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Abstract 

We demonstrate the effects of embedding subgraphs in a Boolean network, which 
is one of the discrete dynamic models for transcriptional regulatory networks. 
After comparing the dynamic properties of networks embedded with seven dif- 
ferent subgraphs including feedback and feedforward subgraphs, we found that 
complexity of the state space increases with longer lengths of attractors, and the 
number of attractors is reduced for networks with more feedforward subgraphs. 
In addition, feedforward subgraphs can provide higher mutual information with 
lower entropy in a temporal program of gene expression. Networks with the other 
six subgraphs show opposite effects on network dynamics. This is roughly consis- 
tent with Thomas's conjecture. These results suggest that feedforward subgraph 
is favorable local structure in complex biological networks. 
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1 Introduction 



Complex networks of interacting elements arising in biological, sociological, and 
physical areas can often be abstracted to graphs or networks. Recent studies of 
networks [112] . including transcriptional regulatory networks in cells, have re- 
vealed at least two statistical properties: power-law connectivity distributions 
having a small number of highly connected nodes; highly clustered connections 
among adjacent nodes |3l4j . The last local structures, called subgraphs or mo- 
tifs, consist of a few nodes and edges among the nodes that are statistically 
significant, and can be regarded as functional modules [4]. Since feedback and 



feedforward subgraphs are basic and ubiquitous circuits in man-made systems, 
one can expect that transcriptional regulatory networks also have both feedback 
and feedforward subgraphs. However, only the feedforward subgraphs prevail [3]. 
Other biological networks such as signal transduction and neuronal networks 
have similar tendencies. This suggests that feedforward subgraphs are favored 
in complex biological networks. In general, although the accumulated data of 
complex networks underlies the statistical significance, it is unclear why feed- 
forward subgraphs are advantageous over other subgraphs in biological systems. 
We therefore constructed Boolean networks and embedded subgraphs in them 
to investigate their effects on both network structure and dynamics. 



2 Model and method 
2.1 Boolean network 

The dynamics of the Boolean networks |5|6j is determined by the equation 

Xi(t + i)=Bi\X{t)] (i = l,2,...,N), (1) 

where Xi(t) is a binary state, either or 1, of node i at time t, Bi(-) are Boolean 
functions [see Table [I] used to update the state of node i, and X(i) is a binary 
vector that gives the states of the N nodes in the network. After assigning the 
initial states X(0) to the nodes, their successive states are updated by input 
states coming from upstream nodes and their Boolean functions. The dynamic 
behavior of these networks is represented by a time series of binary states. The 
time course follows a transient phase from its initial state until it establishes a 
periodic pattern, called an attractor [Fig. [2j. 

Table 1. 16 Boolean functions with indegree Ki n = 2. In this paper, we used 
only No. 1, 2, 4, and 8 Boolean functions shown below, because of biological bias 
for the Boolean functions [7|8|9|10| and the feasibility of computation. 



Inputs 


Output 




1 

1 
1 1 


00000000111 1 1 1 1 1 
0000111100001111 
0011001100110011 
0101010101010101 


Type No. 


1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 



2.2 Numerical condition 



To investigate the effects of embedding subgraphs on the Boolean network dy- 
namics, we randomly constructed many networks with varying numbers of in- 
dependent subgraphs. Seven subgraphs consisted of three nodes and more than 



three directed edges [Fig. [T] and Table [5] . After embedding the specified number 
of subgraphs, the rest of the directed edges were assigned at random. 



Feedforward loop Feedback loop 




Fig. 1. Seven different subgraphs (local structures), The subgraphs consist of 3 
nodes (squares) and more than 3 directed edges (arrows). Subgraphs A and B 
have three internal edges. Subgraphs C - F have four and G has five internal 
edges. The number of cyclic loops increases with the number of internal edges. 
The seven subgraphs can be divided into two groups; subgraphs C, D, E, and G 
comprise subgraph A, feedforward loop; subgraphs B and F do not. 



Table 2. Numerical condition: All constructed networks consist of the same 
amount of network resources, i.e., nodes, directed edges, and Boolean functions. 
Please note that the difference among the generated networks lies in the style of 
their connections. 



Size of networks, N 


128 nodes 




Connectivity 


For all nodes, Kin = Kout = 


2 


Boolean function 


Only AND style [Table [T] 




Types of subgraph 


Seven different subgraphs [Fig 


m 


Number of embedded subgraphs 


0, 10, 20, 30, and 40 




Number of edges 


256 




Number of initial states 


2000 per network 




Number of realizations 


More than 3 x 10 3 in each condition 



2.3 Path length 



To obtain the structural changes of the propagating pathway of the state vari- 
ables after embedding subgraphs, we measured the average path length [TT] , 
which is the average number of the path lengths for all the nodes [Fig. [3b], 
where the path length is the average number of directed edges in the shortest 
path from a node to all reachable nodes. 



2.4 Complexity of state space 

We use two measures to characterize the complexity of state space from the 
initial states [See Fig. [2] : 

1. Basin entropy [11112] : 

H Basin = ~^2p(i) l0g 2 p(i) (2) 

i 

where, the p(i) satisfies ^2 i p(i) — 1, p(i) — 2§oo' anc ^ ai * s ^ e num ber of 
initial states that reached the i-th attractor [Fig. [5] . 

2. Sum of the length of attractors: Each network may contain a different number 
of attractors, and their lengths may vary. This measure defines the total 
length of attractors in state space. 

The two measures indicate the complexity of state space from the initial states. 
According to the definitions, the larger values of two characteristics sgnify a 
higher complexity of the state space [Figs. [2] and 0]. 




Fig. 2. Example of a state space, There are 2 128 (~ 10 39 ) different states (shown 
as circles) in the space of each network. The numbers 3 and 7, inside the at- 
tractors, indicate the attractor lengths. We applied 2000 different initial states 
(shaded circles) to each network [Table [2]. t = 0, 1,...., indicate time steps of Eq. 
©■ 



2.5 Entropy and mutual information 

We measured the entropy (randomness) and mutual information (correlation) 
of state variables to characterize the temporal structure of state variables in the 
Boolean networks 6 13 14]. Both dynamic properties are obtained from attrac- 
tors [Figs. [2 and E]. 




Specified number of Specified number of 

embedded subgraphs embedded subgraphs 



Fig. 3. Structural properties of embedded networks, a: Relationship between the 
specified number of embedded subgraphs and the actual number of embedded 
subgraphs. Dashed line is given by y = x. b: Relationship between the number 
of embedded subgraphs and average path length. Symbols indicate mean and 
error bars show SD. Different capital alphabets indicate different subgraphs [See 
Fig. Hj. 



3 Results 

Since Fig. [3^ shows that our method was successful in embedding subgraphs 
in Boolean networks, the horizontal axis of Figs. |3J [5^, and[Sj3 are the specified 
number of embedded subgraphs. Because the number of inter-subgraph edges de- 
crease as the number of internal edges increase, the average path length prolongs 
as the number of embedded subgraphs increases [Fig. [1] and [3b] . 

We examined the effects of embedding subgraphs on the graphic and dynamic 
properties. Figure S] shows the complexity of state space structures. Feedforward 
has opposite effects from the other six subgraphs. Note that the number of at- 
tractors, basin entropy, and the sum of the lengths of attractors decrease slightly 
with feedforward subgraphs. In general, the complexity of state space increases 
as the number of internal edges increase [See Fig. [I]. The temporal structure of 
state variables is shown in Fig. \5\ The entropies in Fig. [5^ exhibit similar ten- 
dencies to those shown in Fig. |U Unlike Figs. SlandE^,, the amount of mutual 
information increases with an increase in the number of embedded subgraphs 
[Fig. [5b] ■ Figure Et shows rearranged results from Fig.EK and[5jD, indicating the 
productivity of mutual information (correlation) from entropy (randomness). 

4 Discussion 

Based on resultant dynamics of networks as shown in Figs. [Handle the effects 
of embedding subgraphs in Boolean networks can be divided into two groups. 
Networks with more subgraphs, excluding feedforward, show a larger number 
of attractors and greater entropy and mutual information, demonstrating that 
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Fig. 4. Complexity of state space structure. The relationship between the num- 
ber of embedded subgraphs and the number of attractors (a), basin entropy 
(b) [See Eq.([2])], and sum of the length of attractors (c). The symbols indicate 
mean values. Different capital alphabets indicate different subgraphs [Fig. [T]. 
Dashed lines indicate the maximum value based on our condition [Table [2]. The 
curves approaching the dashed lines indicate underestimates of the numbers of 
attractors. 



the six subgraphs increase the complexity of the state space of the networks. 
In other words, the subgraphs behave as destabilizers of state space, pattern 
generators of a temporal program of gene expression, or entropy generators. The 
resultant mutual information (correlation) is driven by entropy as shown in the 
six positive slopes in Fig. [5fc. These results are consistent with the Thomas's 
and Sontag's conjecture [15116117] because the reciprocal edges in a subgraph 
contribute to increasing of cyclic loops in the subgraph [Fig. [Tj . 

On the other hand, networks with more feedforward subgraphs show a smaller 
number of attractors and less entropy, but greater mutual information. This 
indicates that feedforward subgraphs stabilize the state space as well as organize 
temporal patterns with less entropy, as shown by the negative slope in Fig. 0:. 
Detailed analyses with differential equations [18119] show feedforward loops are 
robust to variations in biochemical parameters and work as a low-pass filter. 
Together with our numerical results and related work [11117] , this suggests that 
feedforward loops are favorable local structure in complex biological networks. 

Actual complex biological networks are established based on emergence and 
evolutionary processes, and the resultant structure has many statistical features. 
Here we concentrate on the effects of embedding subgraphs in Boolean networks. 
The constructive approach [6|11|13|14|17] promises to provide insight into the 
prediction of relationships between network structures, behaviors, and functions. 

4.1 Control parameters for Boolean networks 

The control parameters for the dynamics of Boolean networks are input connec- 
tivity, Ki n , the size of network, the bias of the Boolean functions, and output 




Fig. 5. Temporal structure of state variables. The relationship between the num- 
ber of embedded subgraphs and the amount of entropy (a), the amount of mutual 
information (b), and rearranged data from a and b, (c). The symbols indicate 
mean values. Different capital alphabets indicate different subgraphs [Fig. [T] . 



connectivity distributions [6 20 21j. In this report, we change only the connec- 
tion style, while maintaining the same amount of network resources [Table [2]. 
Figures 2] and [5] demonstrate that the internal connection style, such as the num- 
ber of reciprocal edges or cyclic loops, may well be regarded as a novel control 
parameter for the dynamics of Boolean networks. 

4.2 Differences in correlation productivity 

In general, complex adaptive systems, including biological systems, perform their 
functions correctly when certain appropriate communications among nodes are 
established, because such systems need to add or delete nodes, or change the 
connectivity strength to adapt to exogeneous inputs optimally. Therefore, the 
productivity of correlation among nodes is the critical factor for the networks. 
However, the dynamics of our results are collective behavior of the interaction 
of single kind of subgraphs. The lowest correlation productivities in Fig. [SJ; can 
be seen for subgraphs B and F, these two subgraphs do not involve feedforward 
structures [Fig. [T]. The other subgraphs, excluding subgraph A, show greater 
productivity. In fact subgraph C and D have the largest freqencies in signal 
transduction, neuronal networks [314] . 
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